Clinic-Genomic Association Mining for Colorectal Cancer Using Publicly Available Datasets
نویسندگان
چکیده
In recent years, a growing number of researchers began to focus on how to establish associations between clinical and genomic data. However, up to now, there is lack of research mining clinic-genomic associations by comprehensively analysing available gene expression data for a single disease. Colorectal cancer is one of the malignant tumours. A number of genetic syndromes have been proven to be associated with colorectal cancer. This paper presents our research on mining clinic-genomic associations for colorectal cancer under biomedical big data environment. The proposed method is engineered with multiple technologies, including extracting clinical concepts using the unified medical language system (UMLS), extracting genes through the literature mining, and mining clinic-genomic associations through statistical analysis. We applied this method to datasets extracted from both gene expression omnibus (GEO) and genetic association database (GAD). A total of 23,517 clinic-genomic associations between 139 clinical concepts and 7914 genes were obtained, of which 3474 associations between 31 clinical concepts and 1689 genes were identified as highly reliable ones. Evaluation and interpretation were performed using UMLS, KEGG, and Gephi, and potential new discoveries were explored. The proposed method is effective in mining valuable knowledge from available biomedical big data and achieves a good performance in bridging clinical data with genomic data for colorectal cancer.
منابع مشابه
Data Mining for Identification of Forkhead Box O (FOXO3a) in Different Organisms Using Nucleotide and Tandem Repeat Sequences
Background: Deregulation of FOXO3a gene which belongs to Forkhead box O (FOXO) transcription factors, can cause cancer (e.g. breast cancer). FOXO factors have important role in ubiquitination, acetylation, de-acetylation, protein-protein interactions and phosphorylation. Understanding the regulation and mechanisms of FOXO3a can lead to cancer treatment. The aim of this study recent association...
متن کاملMetaBioME: a database to explore commercially useful enzymes in metagenomic datasets
Microbial enzymes have many known applications as biocatalysts in biotechnology, agriculture, medical and other industries. However, only a few enzymes are currently employed for such commercial applications. In this scenario, the current onslaught of metagenomic data provides a new unexplored treasure trove of genomic wealth that can not only enhance the enzyme repertoire by the discovery of n...
متن کاملInvestigating the methylation status of DACT2 gene and its association with MTHFR C677T polymorphism in patients with colorectal cancer
Colorectal cancer (CRC) is one of the common causes of cancer death in Iranian population. Both genetic and epigenetic changes have been implicated in CRC pathogenesis. DACT2 gene as one of the WNT signaling pathway inhibitor was shown to display tumor suppressor activity in many cancers. The aim of present study was to investigate the methylation status of DACT2 gene and its ...
متن کاملUse of Genome-Wide Association Studies for Cancer Research and Drug Repositioning
Although genome-wide association studies have identified many risk loci associated with colorectal cancer, the molecular basis of these associations are still unclear. We aimed to infer biological insights and highlight candidate genes of interest within GWAS risk loci. We used an in silico pipeline based on functional annotation, quantitative trait loci mapping of cis-acting gene, PubMed text-...
متن کاملAssociation between alcohol, dietary factors and subsites of colorectal cancer: an ecological study
Background: Colorectal cancer is the fourth most common cancer, in terms of incidence throughout the world. There are some differences for risk factors involved in the incidence of tumor in different anatomical subsites of large bowel. However, most investigations have not studied the association between dietary factors and colorectal cancer subsites. Thus the current ecological study inves...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
دوره 2014 شماره
صفحات -
تاریخ انتشار 2014